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Abstract: Vitis amurensis is a valuable resource for wine production. 
Ripening of the grape berry is the key phase which determines the com¬ 
position of wine. To better understand the gene expression that manifest 
in V amurensis berry skins during the ripening, cDNA library of V 
amurensis berry skins was constructed. A total of 935 high quality ex¬ 
pressed sequence tags (ESTs) were obtained from the library. These 
ESTs represent 636 unigenes, including 108 contigs and 528 singletons. 
The EST analysis was performed and genes were assigned to functional 
categories according to their primary BLAST match. Of these 25.35% 
were involved with metabolism, 6.27% with cell rescue and defense, 
6.84% energy, 11.68% protein synthesis, 18.8% protein activity regula¬ 
tion, 11.11% cell structure, 7.98% transport, 6.27% transcription and the 
remaining 5.7% were signal transduction. The generated ESTs were 
characterized by the gene ontology analysis and were categorized ac¬ 
cording to its cellular component, molecular function and biological 
process. In the cDNA library, some genes are relevant to the biosynthesis 
of anthocyanins, while some genes are related to grape berry maturation. 
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Introduction 

V amurensis (Amur grape), a famous wild-growing berry, is 
widely distributed in China, Korea, and Japan and its fruit has 
been used as the raw material for juice and wine in those three 
countries (Jeong et al. 2010). The wine made by V. amurensis is 
ruby, bright and has a unique flavor which favored by consumers. 
The ripening of grape berries is accompanied by a massive ac¬ 
cumulation of soluble sugars, as well as the synthesis and accu¬ 
mulation of a wide range of phenolic compounds and aroma 
precursors. These processes play major roles in the quality of the 
berries and wine (Agasse et al. 2009). Besides, the skin is the site 
that anthocyanin biosynthesis occurs. During ripening phase, as 
the level of chlorophyll falls, the content of anthocyanins in¬ 
creases which bring distinctive color to colored berry skins and 
red wine. Therefore, it is important to investigate the genes re¬ 
lated to the biosynthesis of anthocyanins in the V. amurensis 
ripen berry skins. 

Analysis of expressed sequence tags (ESTs) is a rapid and 
effective method to identify novel genes or to investigate gene 
expression in different tissues, organs and plants. This technol¬ 
ogy can also identify genes that are involved in specific biologi¬ 
cal functions, especially for organisms whose genome sequences 
are not available (Peng et al. 2007). Gene functions are assigned 
to ESTs based on homology to known proteins from other spe¬ 
cies. Currently, about 50% of ESTs can be identified in this way 
(Ablett et al. 2000). During a specific development states, the 
abundance of ESTs of the cDNA library can be used to estimate 
the expression level of their corresponding genes (Fei et al. 2004; 
Ma et al. 2004). Analysis of the expression of large numbers of 
genes combined with knowledge of their functions allows us to 
perceive the overall picture of biological processes in different 
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cell types. These studies have been initiated by using primary 
BLAST homologues to divide ESTs matching known proteins 
into functional categories (Ablett et al. 2000). To 2011, more 
than 350,000 ESTs have been generated and analyzed for grape¬ 
vine including both wine grape and table grape (Tillett et al. 
2011 ). 

As mentioned above, compared with V. vinifera, the molecular 
biological research of V. amurensis is limited. In order to analyze 
related gene expression in V amurensis ripen berry skins, we 
constructed cDNA library of V amurensis ripen berry skins and 
obtained 935 ESTs from it. These ESTs were assembled into 108 
contigs and 528 singletons and were analyzed and characterized 
by BLAST alignments and gene ontology analysis. 

Materials and methods 

Plant materials 

The cDNA library was derived from the berry skins of V. amu¬ 
rensis cv. Shuang Hong, the berries were collected 16 weeks 
after flowering from Institute of Special Wild Economic Animals 
and Plants, Chinese Academy of Agricultural Science. The berry 
skin and flesh were then separated, frozen immediately in liquid 
nitrogen and stored at -80 °C. RNA was extracted from random 
sub-samples of -80 °C stored berry skin. 

RNA extraction and cDNA library construction 

The total RNA of V. amurensis ripen berry skins was isolated 
according to the protocol of Chang et al. (1993). The quality of 
RNA was analyzed by 1% agarose gel electrophoresis, Total 
RNA from V. amurensis ripen berry skins should appear as two 
bright bands (28S and 18S ribosomal RNA) at approximately 4.5 
and 1.9 kb. The ratio of intensities of the 28S and 18S rRNA 
bands should be 1.5-2.5:1. The cDNA library was constructed as 
described by Creator™ SMART™ cDNA Library Construction 
Kit User Manual and the amount of starting material for cDNA 
synthesis is 0.8 g of total RNA. 

DNA sequencing and data handling 

Prior to sequencing, random colonies were checked for the pres¬ 
ence of an insert by PCR using the 
Ml3 reverse primer and forward primer. The PCR products were 
separated by electrophoresis using 1.2% agarose gels. DNA se¬ 
quencing was completed by Genomics Institute, Beijing, and P.R. 
China. 

Raw single-pass sequence data demonstrating poor quality 
vector sequences or sequences less than 100 bp were removed. 
The remaining sequences were analyzed. Unigenes were ob¬ 
tained by Phrap assembly program. Unigenes were compared 
against the current non-redundant (nr) protein database at the 
National Center for Biotechnology Information (NCBI, Bethesda, 
MD, USA) using the BLASTX algorithm and the NCBI EST 
database using the BLASTN algorithm (Altschul et al. 1997). 
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Results and discussions 

cDNA library characterization 

The titer of primary cDNA library and amplified library was 
1.001 xlO 6 pfu-mL' 1 and 2.542xl0 9 pfumL' 1 , respectively. The 
cDNA clones selected randomly from the primary cDNA library 
were checked by PCR, and 94% were found to contain inserts 
with the size ranging from 0.3 to 2 kb. The mean insert size is 
0.86 kb, which is shorter than those from grapevine cDNA li¬ 
braries previously reported (e.g. 1.3 kb mean insert size in Ter¬ 
rier et al. 2001; 1.2 kb-1.5 kb mean insert size of six cDNA li¬ 
braries of six different organs in Moser et al. 2005). 

After removing vectors and low-quality sequences, we ob¬ 
tained a total of 935 high-quality EST sequences from the library. 
The average length of our EST sequences was approximately 
416 bp (range 102 bp-611 bp), which is shorter than those ob¬ 
tained from grapevine cDNA libraries (e.g. 463 bases in Moser et 
al. 2005; 527 bases in da Silva et al. 2005; 622 bases in Peng et 
al. 2007). However, the majority of our ESTs (>88%) were 
longer than 100 bases. In conclusion, these results indicated that 
the cDNA library is a reliable resource for EST analysis. 

Analysis of EST sequences 

The sequences of the V. amurensis EST clones were submitted to 
NCBI as dbEST IDs 69043327-69044261 and GenBank acces¬ 
sion Nos.GW665582-GW666516. After sequence comparison 
and analysis, we obtained 636 unique sequences, including 528 
single-sequence and 108 multiple-sequence contigs with the 
frequency of occurrence of uniseqs ranging from 2 to 41. A con- 
tig is a group of cDNAs that share sequence identity and are 
considered to represent transcripts of the same gene. Sequencing 
only from the 5’ terminus may leave some redundancy unde¬ 
tected. It is, therefore, possible that a gene may be represented by 
more than one contig (Sacadura and Saville 2003). 

The EST sequences were compared to the NCBI Nr database 
using the BLASTX algorithms (Altschul et al. 1997). The re¬ 
sult showed that a total of 522 unigenes (707 ESTs, 75.6%) ex¬ 
hibited high similarities (E value <10 5 ) to the genes sequences 
available in the Nr database, while the remainingll4 unigenes 
(228 ESTs, 24.4%) showed little or no similarity. Further study 
needs to be carried out to determine whether these non-matching 
clones are new members of known gene families or novel genes. 
ESTs with similarity scores of E value <10' 5 were grouped into 9 
categories according to their functional annotation which in¬ 
volved in metabolism, cell resistance and defense, energy, pro¬ 
tein synthesis, protein activity regulation, cell structure, transport 
and transcription (Fig. 1). 

With the variation of developmental stages, and physiologi¬ 
cal and pathological states, different cells and organs of a certain 
organism have different expression types and abundance of a 
certain gene. The EST numbers of the same gene represent ex¬ 
pression abundance of the gene in the specific organs (Audio and 
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Claverie 1997). In this cDNA library, approximately 95.9% of 
the ESTs belonged to sequence clusters with low redundancy 
(2-3 sequences per contig), while medium redundancy (4-9 se¬ 
quences per contig) and high redundancy (>10 sequence per 
contig) ESTs represented 3.3% and 0.7%, respectively (Fig. 2). 
This result implies that there is a considerable potential to dis¬ 
cover novel sequences from our cDNA library (Jung et al. 2003). 
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Fig. 1 Functional category of ESTs sequences in V. amurensis library 


V. amurensis and the process involves the coordination of a large 
number of events. Some metabolic activities that occur prior to 
veraison, such as photosynthesis and organic acid accumulation, 
are either turned off at veraison or at least down regulated. Other 
processes, such as the accumulation of anthocyanins in berry 
skins, commence at veraison (Davies and Bottcher 2009). 


Table 1 Organisms showing high sequence similarity with the V. 
amurensis ESTs 


Organism 

No. clones 

Percentage (%) 

Vitis vinifera 

171 

32.8% 

Arabidopsis thaliana 

130 

24.9% 

Oryza sativa subsp. japonica 

22 

4.2% 

Glycine max 

10 

1.9% 

Nicotiana tabacum 

8 

1.5% 

Petunia hybrida 

7 

1.3% 

Triticum aestivum 

7 

1.3% 

Zea mays 

7 

1.3% 

Solanum lycopersicum 

6 

1.1% 

Solanum tuberosum 

6 

1.1% 

Ricinus communis 

4 

0.8% 

Catharanthus roseus 

4 

0.8% 

Citrus sinensis 

4 

0.8% 

Prunus avium 

4 

0.8% 



Occurrence 


Fig. 2 Genes expressed frequency 


A number of V amurensis ESTs showed a great sequence si¬ 
milarity with ESTs from V vinifera (171 clones, 32.8%, Table 1) 
and Arabidopsis thaliana (130 clones, 24.9%, Table 1). The 
following organisms are Oryza sativa subsp. japonica, Glycine 
max, Nicotiana tabacum, Petunia hybrid, Triticum aestivum, Zea 
mays according to priority of similarity. 

By BLAST searching of NCBI databases, some ESTs are 
shown to share significant sequence similarity (E value <10' 15 ) to a 
variety of known genes or gene families. In this study, there are 
142 unigenes that had significant similarity to a variety of known 
genes or gene families (Table 2). For example, V. amurensis EST 
clone number 1-B09 was similar to chalcone synthase (97% 
identity), 04-D09 was similar to phenylalanine ammonia-lyase 
(95% identity), and contingl32 was similar to ripening-related 
protein grip22 (91% identity). 

In the cDNA library, the number of function genes, together 
with gene expression abundance, depends on the characteristic of 
the growth stage of a plant. The sampling time is the ripening of 


The most frequent gene found in the cDNA library was ribo- 
somal protein (28 clones, 5.38%). This result was expected be¬ 
cause ribosomal protein genes are expressed ubiquitously at all 
stages of development (Kim et al. 2006). All protein synthesis 
needs to carry on the ribosome and the number of ribosomes 
influences the protein synthesis directly. This result indicated 
that the process of protein synthesis is active in the V amurensis 
berry skins. 

There are a number of different classes of flavonoids found in 
plants and anthocyanin is one of the most important pigments 
enduing grapes with colored skins, and giving red wine distinc¬ 
tive colors (Boss et al. 1996a). Anthocyanins are located in the 
skin cells, which present as a free, non-complex form inside the 
vacuoles (Ortega-Regules et al. 2008). The anthocyanin biosyn¬ 
thesis is vigorous at the ripening stage of grape. In the cDNA 
library we found some genes correlated with the biosynthesis of 
anthocyanins, such as chalcone synthase (CHS), flavon- 
oid-3’,5’-hydroxylase (F3’5’H), phenylalanine ammonia-lyase 
(PAL), glutathione ^-transferases (GST) and anthocya- 
nidin-3 -glucosyltransferas. 

PAL is the first enzyme involved in anthocyanin production, 
which catalyses the synthesis of cinnamic acid from phenyla¬ 
lanine (Boss and Davies 2009). In the growth process of red 
grape berries, the expression of PAL mainly occurred in the 
grape berry skins after flower 2 to 4 weeks and then decreased. 
The expression of PAL reached to climax during ripening (Boss 
et al. 1996a; Kobayashi et al. 2001). There are 15-20 members in 
grape PAL gene families (Sparvoli et al. 1994). The nucleotide 
sequence and the corresponding amino acid sequence of PAL are 
highly similar among different plants. In the cDNA library the 
PAL ESTs are shown to share significant sequence similarity 
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(E V aiue— 10" 15 ) to Arabidopsis thaliana and Petroselinum crispum molecule probably product different function genes (Sparvoli et 
(Table 2). This gene family possibly stems from an ancient gene, al. 1994). 
and the duplication of the gene together with the divergence of 


Table 2. Unigenes have high similarity to a variety of known genes or gene families 


Clone No. 

Putative homologue 

Accession 

No. 

Organism 

E-value 

Score 

(bits) 

Identity 

(%) 

ContiglOl 

Polygalacturonase inhibitor 

A7PW81 

Vitis vinifera 

2.00E-27 

120 

100 

Contigl08 

Agamous-like MADS-box protein AGL9 homolog 

Q03489 

Petunia hybrida 

2.00E-75 

281 

95 

Contigll 

ADP-ribosylation factor 2 

P51823 

Oryza sativa subsp .japonica 

3.00E-39 

160 

100 

Contigll3 

Ubiquitin-conjugating enzyme E2-17 kDa 

P35135 

Solanum lycopersicum 

7.00E-82 

302 

95 

Contigll9 

Glyceraldehyde-3-phosphate dehydrogenase, cytosolic 

P25861 

Antirrhinum majus 

3.00E-52 

204 

90 

Contigl2 

Glucan endo-l,3-beta-glucosidase 

P52408 

Antirrhinum majus 

4.00E-26 

116 

76 

Contigl21 

Chalcone synthase 1 

Q9XJ58 

Citrus sinensis 

6.00E-39 

159 

80 

Contigl3 

Flavonoid 3',5'-hydroxylase 2 

P48419 

Petunia hybrida 

2.00E-74 

278 

80 

Contigl31 

Glutathione V-transferase 

Q96324 

Arabidopsis thaliana 

3.00E-69 

261 

58 

Contigl32 

Ripening-related protein grip22 

Q9M4H4 

Vitis vinifera 

1.00E-118 

426 

91 

Contigl33 

Uncharacterized mitochondrial protein AtMg00030 

P93276 

Arabidopsis thaliana 

4.00E-28 

124 

96 

Contigl34 

Peptidyl-prolyl cis-trans isomerase 

Q39613 

Catharanthus roseus 

2.00E-36 

151 

91 

Contigl5 

Glyceraldehyde-3-phosphate dehydrogenase, cytosolic 

P26518 

Magnolia liliiflora 

2.00E-29 

127 

87 

Contigl6 

Histone H4 

Q76H85 

Silene latifolia 

2.00E-39 

157 

97 

Contigl7 

ATP synthase subunit epsilon, mitochondrial 

Q96253 

Arabidopsis thaliana 

9.00E-27 

119 

87 

Contigl9 

Flavonoid 3',5'-hydroxylase 2 

P48419 

Petunia hybrida 

6.00E-18 

89.4 

80 

Contig29 

GTP-binding nuclear protein Ran-B 1 

P41919 

Nicotiana tabacum 

2.00E-66 

252 

97 

Contig30 

Ubiquitin-like protein 5 

Q9FGZ9 

Arabidopsis thaliana 

2.00E-35 

148 

95 

Contig31 

Alcohol dehydrogenase 1 

P25141 

Petunia hybrida 

1.00E-144 

509 

81 

Contig32 

Vacuolar-processing enzyme 

P49043 

Citrus sinensis 

1.00E-61 

234 

82 

Contig34 

Peptidyl-prolyl isomerase FKBP12 

004287 

Vicia faba 

3.00E-53 

206 

83 

Contig38 

Polygalacturonase 

P35336 

Actinidia deliciosa 

2.00E-39 

161 

65 

Contig40 

14-3-3-like protein B (Fragment) 

Q96451 

Glycine max 

1.00E-44 

178 

91 

Contig45 

Ubiquitin 

P69326 

Triticum aestivum 

4.00E-36 

150 

100 

Contig47 

Probable aquaporin PIP 1-2 

Q7XSQ9 

Oryza sativa subsp .japonica 

2.00E-61 

234 

91 

Contig5 

Histone H4 

Q76H85 

Silene latifolia 

5.00E-40 

162 

100 

Contig54 

Glyceraldehyde-3-phosphate dehydrogenase, cytosolic 

P26518 

Magnolia liliiflora 

2.00E-63 

241 

81 

Contig55 

Phenylalanine ammonia-lyase 

P35510 

Arabidopsis thaliana 

3.00E-84 

311 

78 

Contig57 

Ubiquitin 

P69326 

Triticum aestivum 

3.00E-36 

150 

100 

Contig58 

40S ribosomal protein S26-3 

Q9FYK9 

Arabidopsis thaliana 

6.00E-16 

83.2 

86 

Contig6 

ADP-ribosylation factor 2 

P51823 

Oryza sativa subsp .japonica 

1.00E-61 

235 

99 

Contig60 

Pyruvate kinase, cytosolic isozyme 

Q42806 

Glycine max 

3.00E-61 

235 

90 

Contig67 

Ubiquitin-like protein SMT3 

P55852 

Arabidopsis thaliana 

3.00E-42 

171 

88 

Contig71 

Ras-related protein ARA-1 

P19892 

Arabidopsis thaliana 

6.00E-79 

293 

84 

Contig73 

30S ribosomal protein S7, chloroplastic 

Q0ZIV9 

Vitis vinifera 

4.00E-60 

229 

100 

Contig76 

Transcription elongation factor 1 homolog 

Q8FHP0 

Oryza sativa subsp .japonica 

2.00E-31 

135 

88 

Contig77 

Isocitrate dehydrogenase [NADP], chloroplastic (Fragment) 

Q40345 

Medicago sativa 

1.00E-120 

432 

89 

Contig78 

Peptide methionine sulfoxide reductase (Fragment) 

P54153 

Solanum lycopersicum 

4.00E-80 

297 

80 

Contig8 

Uncharacterized mitochondrial protein AtMg00030 

P93276 

Arabidopsis thaliana 

9.00E-29 

125 

98 

Contig80 

ATP synthase subunit epsilon, mitochondrial 

Q96253 

Arabidopsis thaliana 

1.00E-26 

119 

85 

Contig82 

Probable glutathione V-transferase 

Q03666 

Nicotiana tabacum 

1.00E-82 

306 

72 

Contig86 

60S ribosomal protein F27a-3 

P49637 

Arabidopsis thaliana 

1.00E-72 

271 

86 

Contig88 

60S ribosomal protein F35a-3 

Q9C912 

Arabidopsis thaliana 

4.00E-56 

217 

91 

Contig91 

50S ribosomal protein F2, chloroplastic 

Q0ZIX7 

Vitis vinifera 

2.00E-72 

273 

99 

Contig93 

Fructose-bisphosphate aldolase cytoplasmic isozyme 

P17784 

Oryza sativa subsp .japonica 

1.00E-57 

222 

90 

Contig96 

Glyceraldehyde-3-phosphate dehydrogenase, cytosolic 

P25861 

Antirrhinum majus 

5.00E-58 

223 

88 

Contig97 

Glutamine synthetase cytosolic isozyme 1 

P51118 

Vitis vinifera 

2.00E-70 

265 

99 

0001 A02 

14-3-3 protein 4 

P42652 

Solanum lycopersicum 

4.00E-65 

246 

93 

0001B09 

Chalcone synthase 

P51090 

Vitis vinifera 

2.00E-79 

294 

97 

0001 D04 

Isopentenyl-diphosphate Delta-isomerase I 

048964 

Camptotheca acuminata 

2.00E-44 

177 

91 
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Continue Table 2 


Clone No. 

Putative homologue 

Accession 

No. 

Organism 

E-value 

Score 

(bits) 

Identity 

(%) 

0001 

Gil 

Monodehydroascorbate reductase 

Q40977 

Pisum sativum 

1.00E-49 

194 

80 

0001 

H12 

Histone H2A 

Q9M531 

Euphorbia esula 

7.00E-17 

85.9 

100 

0002 

B01 

Ras-related protein Rab7 

024461 

Prunus armeniaca 

1.00E-41 

169 

95 

0002 

C01 

24-methylenesterol C-methyltransferase 2 

Q39227 

Arabidopsis thaliana 

2.00E-85 

314 

89 

0002 

C03 

Elongation factor 1-alpha 

P25698 

Glycine max 

3.00E-26 

117 

94 

0002 

D12 

Shaggy-related protein kinase zeta 

Q39010 

Arabidopsis thaliana 

1.00E-86 

318 

91 

0002 

F09 

T-complex protein 1 subunit epsilon 

P54411 

Avena sativa 

3.00E-18 

90.1 

89 

0002 

GO 5 

Acyl-CoA-binding protein 

004066 

Ricinus communis 

8.00E-18 

89 

87 

0002 

GO 6 

Uncharacterized FAM18-like protein Atlg09330 

Q8LEK2 

Arabidopsis thaliana 

2.00E-76 

284 

80 

0003 

D10 

Probable aquaporin PIP2-5 

Q9SV31 

Arabidopsis thaliana 

2.00E-21 

101 

83 

0003 

Ell 

Enolase 

Q42971 

Oryza sativa subsp .japonica 

6.00E-82 

303 

86 

0004 

AO 5 

Pyrophosphate—fructose 6-phosphate 1-phosphotransferase subunit alpha 

Q41140 

Ricinus communis 

2.00E-80 

298 

80 

0004 

B02 

Eukaryotic peptide chain release factor subunit 1-3 

P35614 

Arabidopsis thaliana 

5.00E-46 

182 

85 

0004 

B03 

40S ribosomal protein S12 

Q9XHS0 

Hordeum vulgare 

3.00E-40 

164 

81 

0004 

B08 

F-box protein GID2 

Q9STX3 

Arabidopsis thaliana 

2.00E-27 

122 

80 

0004 

Cll 

60S ribosomal protein L29-1 

Q9M7X7 

Arabidopsis thaliana 

5.00E-25 

112 

83 

0004 

C12 

Cytochrome P450 77A1 (Fragment) 

P37123 

Solanum melongena 

1.00E-22 

104 

82 

0004 

D09 

Phenylalanine ammonia-lyase 1 

P24481 

Petroselinum crispum 

3.00E-85 

313 

95 

0004 

DIO 

60S ribosomal protein L3 

P35684 

Oryza sativa subsp. japonica 

4.00E-17 

86.7 

85 

0004 

D12 

Eukaryotic translation initiation factor 1A 

P56331 

Onobrychis viciifolia 

7.00E-51 

199 

95 

0004 

F12 

3-ketoacyl-CoA thiolase 2, peroxisomal 

Q56WD9 

Arabidopsis thaliana 

2.00E-38 

157 

90 

0004 

G02 

Ferritin-3, chloroplastic 

Q948P6 

Glycine max 

5.00E-37 

120 

84 

0004 

G10 

60S ribosomal protein L7-3 

P60039 

Arabidopsis thaliana 

7.00E-62 

235 

83 

0005 

B03 

SUMO-conjugating enzyme UBC9 

P35132 

Arabidopsis thaliana 

1.00E-70 

265 

98 

0005 

Bll 

Yadcnosyl methionine synthetase 4 

A7PRJ6 

Vitis vinifera 

1.00E-93 

341 

92 

0005 

C02 

Ubiquitin-conjugating enzyme E2 2 

P35130 

Medicago sativa 

2.00E-85 

314 

97 

0005 

D01 

Eukaryotic translation initiation factor 4E type 3 

Q9FK59 I 

Arabidopsis thaliana 

3.00E-90 

330 

81 

0005 

E06 

ADP-ribosylation factor 2 

P51823 

Oryza sativa subsp. japonica 

2.00E-76 

285 

97 

0006 

All 

DNA-damage-repair/toleration protein DRT102 

Q05212 

Arabidopsis thaliana 

4.00E-34 

143 

84 

0006 

B01 

Hydroxymethylglutaryl-CoA synthase 

P54873 

Arabidopsis thaliana 

8.00E-88 

322 

83 

0006 

B02 

Methionine aminopeptidase 2B 

Q56Y85 

Arabidopsis thaliana 

5.00E-25 

113 

94 

0006 

B04 

Protein translation factor SUI1 homolog 

Q0D5W6 J 

Oryza sativa subsp .japonica 

8.00E-55 

213 

93 

0006 

B06 

Stress-related protein 

Q9SW70 

Vitis riparia 

1.00E-85 

314 

96 

0006 

Bll 

Elongation factor 1-alpha 

P25698 

Glycine max 

2.00E-63 

241 

95 

0006 

C06 

Shaggy-related protein kinase eta 

Q39011 

Arabidopsis thaliana 

7.00E-26 

116 

80 

0006 

CIO 

60S ribosomal protein L23 

P49690 

Arabidopsis thaliana 

1.00E-56 

218 

98 

0006 

C12 

60S ribosomal protein L24 

Q9FUL4 

Prunus avium 

9.00E-48 

188 

96 

0006 

D02 

Ubiquitin-conjugating enzyme E2-17 kDa 

P35135 

Solanum lycopersicum 

4.00E-83 

306 

97 

0006 

D12 

60S ribosomal protein L32-1 

P49211 

Arabidopsis thaliana 

6.00E-20 

95.9 

92 

0006 

E01 

Cell division cycle protein 48 homolog 

P54774 

Glycine max 

3.00E-21 

100 

85 

0006 

E08 

Hemoglobin-2 

P23244 

Casuarina glauca 

1.00E-68 

258 

84 

0006 

F01 

Caffeic acid 3-O-methyltransferase 

Q43609 

Prunus dulcis 

2.00E-93 

341 

83 

0006 

HO 3 

Uncharacterized protein At5g 10860, mitochondrial 

Q9LEV3 

Arabidopsis thaliana 

3.00E-84 

310 

82 

0006 

HO 7 

Anthocyanidin 3-O-glucosyltransferase (Fragment) 

P51094 

Vitis vinifera 

4.00E-30 

129 

100 

0007 

AO 5 

FAM10 family protein At4g22670 

Q93YR3 

Arabidopsis thaliana 

9.00E-42 

169 

88 

0007 

B03 

GTP-binding protein SARI A 

004834 

Arabidopsis thaliana 

3.00E-31 

134 

93 

0007 

CIO 

Cytochrome c oxidase subunit 1 

P20681 

Podo ora anserina 

7.00E-93 

339 

90 

0007 

DO 6 

Ubiquitin 

P69326 

Triticum aestivum 

4.00E-36 

150 

100 

0007 

E10 

A aragine synthetase [glutamine-hydrolyzing] 

Q43011 

Oryza sativa subsp. japonica 

4.00E-52 

203 

80 

0007 

HOI 

Thiazole biosynthetic enzyme, chloroplastic 

023787 

Citrus sinensis 

4.00E-19 

93.6 

84 

0007 

H04 

Thiazole biosynthetic enzyme, chloroplastic 

023787 

Citrus sinensis 

1.00E-74 

278 

85 

007-G06 

Pectinesterase 

Q9LVQ0 

Arabidopsis thaliana 

9.00E-61 

231 

76 

0008 

AO 5 

GTP-binding nuclear protein RanlB (Fragment) 

P54766 

Lotus japonicus 

2.00E-36 

150 

80 

0008 

_A07 

Actin-depolymerizing factor 

Q8SAG3 

Vitis vinifera 

5.00E-58 

223 

91 

0008 

A10 

BURP domain-containing protein 3 

Q942D4 

Oryza sativa subsp. japonica 

3.00E-19 

93.6 

80 
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Continue Table 2 


Clone No. 

Putative homologue 

Accession 

No. 

Organism 

E-value 

Score 

(bits) 

Identity 

(%) 

0008 

All 

Probable cellulose synthase A catalytic subunit 8 [UDP-forming] 

Q84ZN6 

Oryza sativa subsp .japonica 

2.00E-78 

291 

88 

0008 

B06 

VAMP-like protein YKT61 

Q9ZRD6 

Ricinus communis 

7.00E-79 

292 

86 

0008 

E03 

Vesicle-associated membrane protein 722 

P47192 

Arabidopsis thaliana 

3.00E-87 

320 

80 

0008 

E08 

Pyruvate kinase, cytosolic isozyme 

Q42806 

Glycine max 

4.00E-57 

219 

91 

0008 

F02 

Leucoanthocyanidin dioxygenase 

P51091 

Malus domestica 

2.00E-70 

263 

81 

0008 

HO 5 

UPF0497 membrane protein 7 

A7PA04 

Vitis vinifera 

9.00E-46 

181 

100 

0008 

HO 7 

Elongation factor 1-gamma 

Q9FUM1 

Prunus avium 

1.00E-77 

288 

89 

0009 

A02 

Cysteine proteinase (Fragment) 

P05993 

Carica papaya 

9.00E-24 

108 

90 

0009 

_A08 

5-methyltetrahydropteroyltriglutamate—homocysteine methyltransferase 

Q42662 

Solenostemon scutellarioides 

7.00E-95 

345 

96 

0009 

C04 

50S ribosomal protein L33, chloroplastic 

Q0ZIZ9 

Vitis vinifera 

4.00E-32 

137 

95 

i 

CTn 

O 

O 

O 

D02 

Catalase isozyme 1 (Fragment) 

P49315 

Nicotiana plumb aginifolia 

2.00E-60 

231 

88 

0009 

Dll 

Fight-induced protein, chloroplastic 

P80471 

Solanum tuberosum 

6.00E-44 

176 

84 

0009 

E06 

Ribosomal protein SI9, mitochondrial 

P27527 

Petunia hybrida 

9.00E-36 

149 

80 

0009 

F02 

UTP—glucose-1-pho hate uridylyltransferase 

064459 

Pyrus pyrifolia 

2.00E-91 

334 

89 

0009 

F06 

Putative multidrug resistance-associated protein 15 

Q7FB56 

Arabidopsis thaliana 

1.00E-53 

208 

86 

0009 

G10 

GTP-binding protein SARI A 

004834 

Arabidopsis thaliana 

2.00E-53 

171 

95 

0009 

Hll 

F-box protein ORE9 

Q9SIM9 

Arabidopsis thaliana 

5.00E-36 

149 

84 

0010 

A04 

Heat shock protein 81-1 

Q0J4P2 

Oryza sativa subsp. japonica 

3.00E-55 

213 

94 

0010 

A12 

Proteasome subunit beta type-6 

Q8FD27 

Arabidopsis thaliana 

9.00E-30 

129 

86 

0010 

C04 

Pyropho hate-energized vacuolar membrane proton pump 

P21616 

Phaseolus aureus 

3.00E-28 

123 

91 

0010 

Cll 

Translationally-controlled tumor protein homolog 

Q43847 

Solanum tuberosum 

6.00E-21 

100 

81 

0010 

DO 8 

Superoxide dismutase [Cu-Zn] 4AP 

P23346 

Zea mays 

2.00E-15 

80.9 

94 

0010 

E07 

Uncharacterized protein At5g01610 

Q9M015 

Arabidopsis thaliana 

1.00E-45 

181 

82 

0010 

E09 

Obtusifoliol 14-alpha demethylase (Fragment) 

P93596 

Triticum aestivum 

3.00E-65 

246 

87 

0010 

F10 

Alpha-soluble NSF attachment protein 

P93798 

Vitis vinifera 

6.00E-36 

149 

91 

0010 

G01 

60S ribosomal protein FI7-2 

P51413 

Arabidopsis thaliana 

6.00E-72 

269 

88 

0011 

C02 

Multidrug resistance-associated protein 6 

Q8FGU1 

Arabidopsis thaliana 

2.00E-43 

174 

88 

0011 

C06 

Cell division protease ftsH homolog 1, chloroplastic 

Q5Z974 

Oryza sativa subsp. japonica 

1.00E-100 

363 

96 

0011 

D03 

Translationally-controlled tumor protein homolog 

Q5J907 

Elaeis guineensis var. tenera 

6.00E-33 

139 

89 

0011 

F01 

60S ribosomal protein F37a 

Q9XHE4 

Gossypium hirsutum 

5.00E-47 

186 

98 

0011 

F07 

Obtusifoliol 14-alpha demethylase (Fragment) 

P93596 

Triticum aestivum 

5.00E-50 

196 

90 

0011 

GO 8 

60S ribosomal protein FI 1-1 

P42795 

Arabidopsis thaliana 

7.00E-16 

82.4 

90 

0011 

Gil 

Uncharacterized protein At2g23090 

064818 

Arabidopsis thaliana 

1.00E-30 

132 

86 

0011 

G12 

Calmodulin 

P62201 

Lilium longiflorum 

5.00E-72 

269 

99 

0011 

H05 

Aquaporin PIP 1-3 

Q08733 

Arabidopsis thaliana 

1.00E-63 

241 

88 

0012 

C03 

Expansin-A4 

Q0DHB7 

Oryza sativa subsp. japonica 

2.00E-76 

284 

82 

0012 

G06 

Calreticulin 

P93508 

Ricinus communis 

6.00E-62 

236 

82 


CHS is the first enzyme in the flavonoid biosynthetic pathway 
and chalcone is the first flavonoid produced (Boss and Davies 
2009). In the growth process of red grape berries, the expression 
of CHS mainly occurred in the grape berry skins after flower 2 to 
4 weeks and then decreased. The expression of PAL reached to 
climax during ripening (Boss et al. 1996a; Kobayashi et al. 2001; 
Waters et al. 2005). The CHS of grape was coded by multiple 
genes families and there are 3-4 members involved. The CHS 
EST involved in the cDNA library is shown to share significant 
sequence similarity (E value <10 15 ) to Arabidopsis thaliana (Table 
2 ). 

F3’5’H belongs to the gene families of cytochrome P450. 
During the berry ripening, the transcription abundance of F3 ’5 H 
is closely related to hydroxylation level of anthocyanins. The 
proposal is that the transcription level of F3 ’5 H determines the 
ratio of cyanidin anthocyanins and delphinidin anthocyanins, and 
then influences berry skins color (Bogs et al. 2006; Jeong et al. 

Springer 


2006). 

GST has a range of functions including the transport of an¬ 
thocyanins into the vacuole. The anthocyanins is one of GSTs 
substrates in plant, such as Bz2 (type III GST) of Zea mays and 
An9 (type I plant GST) of Petunia hybrid regulated by conserved 
transcriptional activators during anthocyanins biosynthetic path¬ 
way. Coded enzyme protein may urge the anthocyanins glu¬ 
tathione and deposits in the vacuole (Alfenito et al. 1998; Wang 
and Liu 2008). Therefore GST can be considered the final en¬ 
zyme in the anthocyanins biosynthetic pathway (Alfenito et al. 
1998; Terrier et al. 2005; Ageorges et al. 2006). 

Anthocyanidin-3-glucosyltransferase is the only and key gene 
whose expression is correlated with anthocyanin biosynthesis 
(Boss et al. 1996a; Boss et al. 1996b), the expression of which 
determines whether or not anthocyanins are synthesized (Waters 
et al. 2005). 

There are some genes related to grape berry maturation in this 
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cDNA library, such as polygalacturonase ( PG ), ripening-related 
protein grip22, pectinesterase (PE), and glucosan interior con¬ 
tact-1, 3-beta-grape glycosidase and so on. PG is the cell wall 
hydrolytic enzymes that expresses specifically during berry rip¬ 
ening and participates in pectin dissolution, which is also the 
major enzyme of berry soften processes. Ripening-related protein 
grip22 gene has been proved to give rise to ripening of grape 
berries and may be related to thematic proteins, which is a class 
of proteins involved in disease resistance (Boss et al. 1996b). 
The PE increase solubility of the pectin in water in order to favor 
PG function. 

The Gene Ontology (GO) analysis has been widely used to 
characterize gene function annotation and classification (Tanguy 
et al. 2008; Uno et al. 2008). In this study, the generated ESTs 
were categorized using GO terms as shown in Table 3, which 
provide a structured vocabulary to describe a sequence according 
to its cellular component, molecular function and biological 
process. Most ESTs appeared to be related to physiological 
processes. The result showed that the ripening process of V. 
amurensis is a complex physiological process. 


Table 3 Gene Ontology category of ESTs in V. amurensis library 


Gene Ontology term 

No. clones sequenced 

Cell component 

Cell 

93 

protein complex 

37 

organelle 

41 

extracellular region 

2 

Molecular Function 

antioxidant activity 

2 

catalytic activity 

143 

binding 

127 

structural molecule activity 

24 

transcription regulator activity 

4 

transporter activity 

36 

enzyme regulator activity 

2 

nutrient reservoir activity 

3 

signal transducer activity 

4 

translation regulator activity 

6 

Biological process 

cellular process 

156 

physiological process 

186 

regulation of biological process 

8 

response to stimulus 

13 

reproduction 

1 


Conclusion 

In this study we described the construction of a cDNA library 
from V. amurensis and obtained 935 high quality EST sequences. 
These genes showed diverse functions involved in cellular com¬ 
ponent, molecular function and biological processes according to 
GO annotation. Our results suggested that the ripening of V. 
amurensis is a complex process containing multiple physiologi¬ 


cal and metabolic pathways. Our ESTs analysis can be helpful 
for further molecular mechanisms and genetic studies of V. 
amurensis. 
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